- Title
- A Novel Deep Learning-Based Approach for Malware Detection
- Creator
- Shaukat, Kamran; Luo, Suhuai; Varadharajan, Vijay
- Relation
- Engineering Applications of Artificial Intelligence Vol. 122, Issue June 2023, no. 106030
- Publisher Link
- http://dx.doi.org/10.1016/j.engappai.2023.106030
- Publisher
- Elsevier
- Resource Type
- journal article
- Date
- 2023
- Description
- Malware detection approaches can be classified into two classes, including static analysis and dynamic analysis. Conventional approaches of the two classes have their respective advantages and disadvantages. For example, static analysis is faster but cannot detect the malware variants generated through code obfuscation, whereas dynamic analysis can effectively detect variants generated through code obfuscation but is slower and requires intensive resources. This paper proposes a novel deep learning-based approach for malware detection. It delivers better performance than conventional approaches by combining static and dynamic analysis advantages. First, it visualises a portable executable (PE) file as a coloured image. Second, it extracts deep features from the colour image using fine-tuned deep learning model. Third, it detects malware based on the deep features using support vector machines (SVM). The proposed method combines deep learning with machine learning and eliminates the need for intensive feature engineering tasks and domain knowledge. The proposed approach is scalable, cost-effective, and efficient. The detection effectiveness of the proposed method is validated through 12 machine learning models and 15 deep learning models. The generalisability of the proposed framework is validated on various benchmark datasets. The proposed approach outperformed with an accuracy of 99.06% on the Malimg dataset. The Wilcoxon signed-rank test is used to show the statistical significance of the proposed framework. The detailed experimental results demonstrate the superiority of the proposed method over the other state-of-the-art approaches, with an average increase in accuracy of 16.56%. Finally, to tackle the problems of imbalanced data and the shortage of publicly available datasets for malware detection, various data augmentation techniques are proposed, which lead to improved performance. It is evident from the results that the proposed framework can be useful to the defence industry, which will be helpful in devising more efficient malware detection solutions.
- Subject
- malware detection; cybersecurity; machine learning; deep learning; transfer learning; ensembling
- Identifier
- http://hdl.handle.net/1959.13/1479951
- Identifier
- uon:50418
- Identifier
- ISSN:0952-1976
- Language
- eng
- Reviewed
- Hits: 1313
- Visitors: 1305
- Downloads: 0
Thumbnail | File | Description | Size | Format |
---|